Deep Learning and Text Mining: Classifying and Extracting Key Information from Construction Accident Narratives

نویسندگان

چکیده

Construction accidents can lead to serious consequences. To reduce the occurrence of such and strengthen execution capabilities in on-site safety management, managers must analyze accident report texts depth extract valuable information from them. However, are usually presented unstructured or semi-structured forms; analyzing these manually requires a lot time effort, it is difficult cope with demand large number texts, quality key extracted may be poor. Therefore, this study proposes classification method based on natural language processing (NLP) technology. First, we developed text model convolutional neural network (CNN) that automatically classify categories features. Next, taking classified fall as an example, narratives using term frequency-inverse document frequency (TF-IDF) visually word clouds. The results show overall accuracy CNN reaches 84%, which better than other three shallow machine-learning models. Then, eight areas accident-prone operations were identified TF-IDF algorithm. This provide important guidance for project used management help prevent production accidents.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Optimal Classifying and Extracting Fuzzy Relationship from Query Using Text Mining Techniques

Text mining techniques are generally applied for classifying the text, finding fuzzy relations and structures in data sets. This research provides plenty text mining capabilities. One common application is text classification and event extraction, which encompass deducing specific knowledge concerning incidents referred to in texts. The main contribution of this paper is the clarification of a ...

متن کامل

A review of text mining approaches and their function in discovering and extracting a topic

Background and aim: Four text mining methods are examined and focused on understanding and identifying their properties and limitations in subject discovery. Methodology: The study is an analytical review of the literature of text mining and topic modeling.  Findings: LSA could be used to classify specific and unique topics in documents that address only a single topic. The other three text min...

متن کامل

Domain specific text mining almost from scratch with Deep Learning

Deep learning will help us to better design text mining applications, but perhaps not remove the computational linguistic design process associated with text mining applications (Manning, 2015). There has been extensive work on applying deep learning algorithms to different text mining applications such as information retrieval (IR) and information extraction (IE) and so far they have improved ...

متن کامل

Extracting Financial Information from Text Documents

The majority of electronic data today is in textual form. Financial data such as articles in the Wall Street Journal are written as texts. These electronic documents contain a wealth of information but require human interpretation. For financial analysis, rapid up-to-date information is critical. Most software tools currently require data which are better structured than text (such as data in r...

متن کامل

Extracting medication information from clinical text

The Third i2b2 Workshop on Natural Language Processing Challenges for Clinical Records focused on the identification of medications, their dosages, modes (routes) of administration, frequencies, durations, and reasons for administration in discharge summaries. This challenge is referred to as the medication challenge. For the medication challenge, i2b2 released detailed annotation guidelines al...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Applied sciences

سال: 2023

ISSN: ['2076-3417']

DOI: https://doi.org/10.3390/app131910599